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Started: 
Finished: 
Elapsed: 
Total Warnings : 
Total Errors: 
No. of SeqIDs Defined: 
Actual SeqID Count: 

Error code Error Description 

This error has occured more than 20 

W 213 Artificial or Unknown found in <2 13> 

W 213 Artificial or Unknown found in <213> 

W 213 Artificial or Unknown found in <213> 
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This error has occured more than 20 
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SEQUENCE LISTING 



<110> CHIRON CORPORATION 
HARDY, Stephen F 
DONNELLY, III, John J 
ZUR MEGEDE, Jan T 

<120> VECTORS FOR EXPRESSION OF HML-2 POLYPEPTIDES 

<130> PP19482 .0007 

<140> 10587032 
<141> 2009-10-13 

<15 0> PCT/US0 3/18 66 6 
<151> 2003-06-13 

<150> 60/388831 
<151> 2002-06-16 

<150> 60/472189 
<151> 2003-05-20 

<160> 83 

<170> Patentln, version 3.5 

<210> 1 
<211> 1998 
<212> DNA 

<213> Human endogenous retrovirus, K family (HERV-K) 
<400> 1 

atggggcaaa ctaaaagtaa aattaaaagt aaatatgcct cttatctcag ctttattaaa 60 

attcttttaa aaagaggggg agttaaagta tctacaaaaa atctaatcaa gctatttcaa 120 

ataatagaac aattttgccc atggtttcca gaacaaggaa ctttagatct aaaagattgg 180 

aaaagaattg gtaaggaact aaaacaagca ggtaggaagg gtaatatcat tccacttaca 2 40 

gtatggaatg attgggccat tattaaagca gctttagaac catttcaaac agaagaagat 300 

agcgtttcag tttctgatgc ccctggaagc tgtataatag attgtaatga aaacacaagg 360 

aaaaaatccc agaaagaaac ggaaggttta cattgcgaat atgtagcaga gccggtaatg 420 

gctcagtcaa cgcaaaatgt tgactataat caattacagg aggtgatata tcctgaaacg 4 80 

ttaaaattag aaggaaaagg tccagaatta gtggggccat cagagtctaa accacgaggc 540 

acaagtcctc ttccagcagg tcaggtgcct gtaacattac aacctcaaaa gcaggttaaa 600 

gaaaataaga cccaaccgcc agtagcctat caatactggc ctccggctga acttcagtat 6 60 

cggccacccc cagaaagtca gtatggatat ccaggaatgc ccccagcacc acagggcagg 720 

gcgccatacc ctcagccgcc cactaggaga cttaatccta cggcaccacc tagtagacag 780 

ggtagtaaat tacatgaaat tattgataaa tcaagaaagg aaggagatac tgaggcatgg 8 40 

caattcccag taacgttaga accgatgcca cctggagaag gagcccaaga gggagagcct 900 

cccacagttg aggccagata caagtctttt tcgataaaaa agctaaaaga tatgaaagag 960 

ggagtaaaac agtatggacc caactcccct tatatgagga cattattaga ttccattgct 1020 

catggacata gactcattcc ttatgattgg gagattctgg caaaatcgtc tctctcaccc 1080 

tctcaatttt tacaatttaa gacttggtgg attgatgggg tacaagaaca ggtccgaaga 1140 

aatagggctg ccaatcctcc agttaacata gatgcagatc aactattagg aataggtcaa 1200 

aattggagta ctattagtca acaagcatta atgcaaaatg aggccattga gcaagttaga 1260 

gctatctgcc ttagagcctg ggaaaaaatc caagacccag gaagtacctg cccctcattt 1320 

aatacagtaa gacaaggttc aaaagagccc tatcctgatt ttgtggcaag gctccaagat 1380 

gttgctcaaa agtcaattgc tgatgaaaaa gcccgtaagg tcatagtgga gttgatggca 1440 



tatgaaaacg ccaatcctga gtgtcaatca gccattaagc cattaaaagg aaaggttcct 1500 

gcaggatcag atgtaatctc agaatatgta aaagcctgtg atggaatcgg aggagctatg 1560 

cataaagcta tgcttatggc tcaagcaata acaggagttg ttttaggagg acaagttaga 1620 

acatttggaa gaaaatgtta taattgtggt caaattggtc acttaaaaaa gaattgccca 1680 

gtcttaaata aacagaatat aactattcaa gcaactacaa caggtagaga gccacctgac 1740 

ttatgtccaa gatgtaaaaa aggaaaacat tgggctagtc aatgtcgttc taaatttgat 1800 

aaaaatgggc aaccattgtc gggaaacgag caaaggggcc agcctcaggc cccacaacaa 1860 

actggggcat tcccaattca gccatttgtt cctcagggtt ttcagggaca acaaccccca 1920 

ctgtcccaag tgtttcaggg aataagccag ttaccacaat acaacaattg tcccccgcca 1980 

caagcggcag tgcagcag 1998 

<210> 2 
<211> 2001 
<212> DNA 

<213> Human endogenous retrovirus, K family (HERV-K) 
<400> 2 

atggggcaaa ctaaaagtaa aattaaaagt aaatatgcct cttatctcag ctttattaaa 60 

attcttttaa aaagaggggg agttaaagta tctacaaaaa atctaatcaa gctatttcaa 120 

ataatagaac aattttgccc atggtttcca gaacaaggaa ctttagatct aaaagattgg 180 

aaaagaattg gtaaggaact aaaacaagca ggtaggaagg gtaatatcat tccacttaca 2 40 

gtatggaatg attgggccat tattaaagca gctttagaac catttcaaac agaagaagat 300 

agcgtttcag tttctgatgc ccctggaagc tgtataatag attgtaatga aaacacaagg 360 

aaaaaatccc agaaagaaac ggaaggttta cattgcgaat atgtagcaga gccggtaatg 420 

gctcagtcaa cgcaaaatgt tgactataat caattacagg aggtgatata tcctgaaacg 4 80 

ttaaaattag aaggaaaagg tccagaatta gtggggccat cagagtctaa accacgaggc 540 

acaagtcctc ttccagcagg tcaggtgcct gtaacattac aacctcaaaa gcaggttaaa 600 

gaaaataaga cccaaccgcc agtagcctat caatactggc ctccggctga acttcagtat 660 

cggccacccc cagaaagtca gtatggatat ccaggaatgc ccccagcacc acagggcagg 720 

gcgccatacc ctcagccgcc cactaggaga cttaatccta cggcaccacc tagtagacag 780 

ggtagtaaat tacatgaaat tattgataaa tcaagaaagg aaggagatac tgaggcatgg 8 40 

caattcccag taacgttaga accgatgcca cctggagaag gagcccaaga gggagagcct 900 

cccacagttg aggccagata caagtctttt tcgataaaaa agctgaaaga tatgaaagag 9 60 

ggagtaaaac agtatggacc caactcccct tatatgagga cattattaga ttccattgct 1020 

catggacata gactcattcc ttatgattgg gagattctgg caaaatcgtc tctctcaccc 1080 

tctcaatttt tacaatttaa gacttggtgg attgatgggg tacaagaaca ggtccgaaga 1140 

aatagggctg ccaatcctcc agttaacata gatgcagatc aactattagg aataggtcaa 1200 

aattggagta ctattagtca acaagcatta atgcaaaatg aggccattga gcaagttaga 1260 

gctatctgcc ttagagcctg ggaaaaaatc caagacccag gaagtacctg cccctcattt 1320 

aatacagtaa gacaaggttc aaaagagccc tatcctgatt ttgtggcaag gctccaagat 1380 

gttgctcaaa agtcaattgc tgatgaaaaa gcccgtaagg tcatagtgga gttgatggca 1440 

tatgaaaacg ccaatcctga gtgtcaatca gccattaagc cattaaaagg aaaggttcct 1500 

gcaggatcag atgtaatctc agaatatgta aaagcctgtg atggaatcgg aggagctatg 1560 

tataaagcta tgcttatggc tcaagcaata acaggagttg ttttaggagg acaagttaga 1620 

acatttggaa gaaaatgtta taattgtggt caaattggtc acttaaaaaa gaattgccca 1680 

gtcttaaata aacagaatat aactattcaa gcaactacaa caggtagaga gccacctgac 1740 

ttatgtccaa gatgtaaaaa aggaaaacat tgggctagtc aatgtcgttc taaatttgat 1800 

aaaaatgggc aaccattgtc gggaaacgag caaaggggcc agcctcaggc cccacaacaa 1860 

actggggcat tcccaattca gccatttgtt cctcagggtt ttcagggaca acaaccccca 1920 

ctgtcccaag tgtttcaggg aataagccag ttaccacaat acaacaattg tcccccgcca 1980 

caagcggcag tgcagcagta g 2001 



<210> 3 

<211> 2004 

<212> DNA 

<213> Human endogenous retrovirus, K family (HERV-K) 



<400> 3 

atggggcaaa ctaaaagtaa aactaaaagt aaatatgcct cttatctcag ctttattaaa 60 

attcttttaa aaagaggggg agttagagta tctacaaaaa atctaatcaa gctatttcaa 120 

ataatagaac aattttgccc atggtttcca gaacaaggaa ctttagatct aaaagattgg 180 

aaaagaattg gcgaggaact aaaacaagca ggtagaaagg gtaatatcat tccacttaca 2 40 

gtatggaatg attgggccat tattaaagca gctttagaac catttcaaac aaaagaagat 300 

agcgtttcag tttctgatgc ccctggaagc tgtgtaatag attgtaatga aaagacaggg 360 

agaaaatccc agaaagaaac agaaagttta cattgcgaat atgtaacaga gccagtaatg 420 

gctcagtcaa cgcaaaatgt tgactataat caattacagg gggtgatata tcctgaaacg 4 80 

ttaaaattag aaggaaaagg tccagaatta gtggggccat cagagtctaa accacgaggg 540 

ccaagtcctc ttccagcagg tcaggtgccc gtaacattac aacctcaaac gcaggttaaa 600 

gaaaataaga cccaaccgcc agtagcttat caatactggc cgccggctga acttcagtat 6 60 

ctgccacccc cagaaagtca gtatggatat ccaggaatgc ccccagcact acagggcagg 720 

gcgccatatc ctcagccgcc cactgtgaga cttaatccta cagcatcacg tagtggacaa 780 

ggtggtacac tgcacgcagt cattgatgaa gccagaaaac agggagatct tgaggcatgg 8 40 

cggttcctgg taattttaca actggtacag gccggggaag agactcaagt aggagcgcct 900 

gcccgagctg agactagatg tgaacctttc accatgaaaa tgttaaaaga tataaaggaa 960 

ggagttaaac aatatggatc caactcccct tatataagaa cattattaga ttccattgct 1020 

catggaaata gacttactcc ttatgactgg gaaagtttgg ccaaatcttc cctttcatcc 1080 

tctcagtatc tacagtttaa aacctggtgg attgatggag tacaagaaca ggtacgaaaa 1140 

aatcaggcta ctaagcccac tgttaatata gacgcagacc aattgttagg aacaggtcca 1200 

aattggagca ccattaacca acaatcagtg atgcagaatg aggctattga acaagtaagg 1260 

gctatttgcc tcagggcctg gggaaaaatt caggacccag gaacagcttt ccctattaat 1320 

tcaattagac aaggctctaa agagccatat cctgactttg tggcaagatt acaagatgct 1380 

gctcaaaagt ctattacaga tgacaatgcc cgaaaagtta ttgtagaatt aatggcctat 1440 

gaaaatgcaa atccagaatg tcagtcggcc ataaagccat taaaaggaaa agttccagca 150 0 

ggagttgatg taattacaga atatgtgaag gcttgtgatg ggattggagg agctatgcat 1560 

aaggcaatgc taatggctca agcaatgagg gggctcactc taggaggaca agttagaaca 1620 

tttgggaaaa aatgttataa ttgtggtcaa atcggtcatc tgaaaaggag ttgcccagtc 1680 

ttaaataaac agaatataat aaatcaagct attacagcaa aaaataaaaa gccatctggc 174 0 

ctgtgtccaa aatgtggaaa aggaaaacat tgggccaatc aatgtcattc taaatttgat 1800 

aaagatgggc aaccattgtc gggaaacagg aagaggggcc agcctcaggc cccccaacaa 18 60 

actggggcat tcccagttca actgtttgtt cctcagggtt ttcaaggaca acaaccccta 1920 

cagaaaatac caccacttca gggagtcagc caattacaac aatccaacag ctgtcccgcg 1980 

ccacagcagg cagcgccaca gtag 2 00 4 

<210> 4 
<211> 852 
<212> DNA 

<213> Human endogenous retrovirus, K family (HERV-K) 
<400> 4 

atggggcaaa ctaaaagtaa aattaaaagt aaatatgcct cttatctcag ctttattaaa 60 

attcttttaa aaagaggggg agttaaagta tctacaaaaa atctaatcaa gctatttcaa 120 

ataatagaac aattttgccc atggtttcca gaacaaggaa cttcagatct aaaagattgg 180 

aaaagaattg gtaaggaact aaaacaagca ggtaggaagg gtaatatcat tccacttaca 2 40 

gtatggaatg attgggccat tattaaagca gctttagaac catttcaaac agaagaagat 300 

agcatttcag tttctgatgc ccctggaagc tgtttaatag attgtaatga aaacacaagg 360 

aaaaaatccc agaaagaaac cgaaagttta cattgcgaat atgtagcaga gccggtaatg 420 

gctcagtcaa cgcaaaatgt tgactataat caattacagg aggtgatata tcctgaaacg 4 80 

ttaaaattag aaggaaaagg tccagaatta atggggccat cagagtctaa accacgaggc 540 

acaagtcctc ttccagcagg tcaggtgctc gtaagattac aacctcaaaa gcaggttaaa 600 

gaaaataaga cccaaccgca agtagcctat caatactgcc gctggctgaa cttcagtatc 6 60 

ggccaccccc agaaagtcag tatggatatc caggaatgcc cccagcacca cagggcaggg 720 

cgccatacca tcagccgccc actaggagac ttaatcctat ggcaccacct agtagacagg 7 80 

gtagtgaatt acatgaaatt attgataaat caagaaagga aggagatact gaggcatggc 8 40 

aattcccagt aa 852 



<210> 5 

<211> 666 

<212> PRT 

<213> Human endogenous retrovirus, K family (HERV-K) 

<400> 5 

Met Gly Gin Thr Lys Ser Lys lie Lys Ser Lys Tyr Ala Ser Tyr Leu 
15 10 15 

Ser Phe lie Lys lie Leu Leu Lys Arg Gly Gly Val Lys Val Ser Thr 

20 25 30 

Lys Asn Leu lie Lys Leu Phe Gin lie lie Glu Gin Phe Cys Pro Trp 
35 40 45 

Phe Pro Glu Gin Gly Thr Leu Asp Leu Lys Asp Trp Lys Arg lie Gly 
50 55 60 

Lys Glu Leu Lys Gin Ala Gly Arg Lys Gly Asn lie lie Pro Leu Thr 
65 70 75 80 

Val Trp Asn Asp Trp Ala lie lie Lys Ala Ala Leu Glu Pro Phe Gin 
85 90 95 

Thr Glu Glu Asp Ser Val Ser Val Ser Asp Ala Pro Gly Ser Cys lie 
100 105 110 

lie Asp Cys Asn Glu Asn Thr Arg Lys Lys Ser Gin Lys Glu Thr Glu 
115 120 125 

Gly Leu His Cys Glu Tyr Val Ala Glu Pro Val Met Ala Gin Ser Thr 
130 135 140 

Gin Asn Val Asp Tyr Asn Gin Leu Gin Glu Val lie Tyr Pro Glu Thr 
145 150 155 160 

Leu Lys Leu Glu Gly Lys Gly Pro Glu Leu Val Gly Pro Ser Glu Ser 
165 170 175 

Lys Pro Arg Gly Thr Ser Pro Leu Pro Ala Gly Gin Val Pro Val Thr 
180 185 190 

Leu Gin Pro Gin Lys Gin Val Lys Glu Asn Lys Thr Gin Pro Pro Val 
195 200 205 

Ala Tyr Gin Tyr Trp Pro Pro Ala Glu Leu Gin Tyr Arg Pro Pro Pro 

210 215 220 

Glu Ser Gin Tyr Gly Tyr Pro Gly Met Pro Pro Ala Pro Gin Gly Arg 

225 230 235 240 

Ala Pro Tyr Pro Gin Pro Pro Thr Arg Arg Leu Asn Pro Thr Ala Pro 
245 250 255 



Pro Ser Arg Gin Gly Ser Lys Leu His Glu lie lie Asp Lys Ser Arg 
260 265 270 



Lys Glu Gly Asp Thr Glu Ala Trp Gin Phe Pro Val Thr Leu Glu Pro 

275 280 285 



Met Pro Pro Gly Glu Gly Ala Gin Glu Gly Glu Pro Pro Thr Val Glu 

290 295 300 

Ala Arg Tyr Lys Ser Phe Ser lie Lys Lys Leu Lys Asp Met Lys Glu 
305 310 315 320 

Gly Val Lys Gin Tyr Gly Pro Asn Ser Pro Tyr Met Arg Thr Leu Leu 

325 330 335 

Asp Ser lie Ala His Gly His Arg Leu lie Pro Tyr Asp Trp Glu lie 
340 345 350 

Leu Ala Lys Ser Ser Leu Ser Pro Ser Gin Phe Leu Gin Phe Lys Thr 
355 360 365 

Trp Trp lie Asp Gly Val Gin Glu Gin Val Arg Arg Asn Arg Ala Ala 
370 375 380 

Asn Pro Pro Val Asn lie Asp Ala Asp Gin Leu Leu Gly lie Gly Gin 
385 390 395 400 

Asn Trp Ser Thr lie Ser Gin Gin Ala Leu Met Gin Asn Glu Ala lie 
405 410 415 

Glu Gin Val Arg Ala lie Cys Leu Arg Ala Trp Glu Lys lie Gin Asp 
420 425 430 

Pro Gly Ser Thr Cys Pro Ser Phe Asn Thr Val Arg Gin Gly Ser Lys 
435 440 445 

Glu Pro Tyr Pro Asp Phe Val Ala Arg Leu Gin Asp Val Ala Gin Lys 
450 455 460 

Ser lie Ala Asp Glu Lys Ala Arg Lys Val lie Val Glu Leu Met Ala 
465 470 475 480 

Tyr Glu Asn Ala Asn Pro Glu Cys Gin Ser Ala lie Lys Pro Leu Lys 
485 490 495 

Gly Lys Val Pro Ala Gly Ser Asp Val lie Ser Glu Tyr Val Lys Ala 
500 505 510 

Cys Asp Gly lie Gly Gly Ala Met Tyr Lys Ala Met Leu Met Ala Gin 
515 520 525 

Ala lie Thr Gly Val Val Leu Gly Gly Gin Val Arg Thr Phe Gly Arg 
530 535 540 

Lys Cys Tyr Asn Cys Gly Gin lie Gly His Leu Lys Lys Asn Cys Pro 
545 550 555 560 



Val Leu Asn Lys Gin Asn lie Thr lie Gin Ala Thr Thr Thr Gly Arg 
565 570 575 



Glu Pro Pro Asp Leu Cys Pro Arg Cys Lys Lys Gly Lys His Trp Ala 
580 585 590 



Ser Gin Cys Arg Ser Lys Phe Asp Lys Asn Gly Gin Pro Leu Ser Gly 
595 600 605 

Asn Glu Gin Arg Gly Gin Pro Gin Ala Pro Gin Gin Thr Gly Ala Phe 
610 615 620 

Pro lie Gin Pro Phe Val Pro Gin Gly Phe Gin Gly Gin Gin Pro Pro 

625 630 635 640 

Leu Ser Gin Val Phe Gin Gly lie Ser Gin Leu Pro Gin Tyr Asn Asn 
645 650 655 

Cys Pro Pro Pro Gin Ala Ala Val Gin Gin 
660 665 

<210> 6 
<211> 667 
<212> PRT 

<213> Human endogenous retrovirus, K family (HERV-K) 
<400> 6 

Met Gly Gin Thr Lys Ser Lys Thr Lys Ser Lys Tyr Ala Ser Tyr Leu 
15 10 15 

Ser Phe lie Lys lie Leu Leu Lys Arg Gly Gly Val Arg Val Ser Thr 
20 25 30 

Lys Asn Leu lie Lys Leu Phe Gin lie lie Glu Gin Phe Cys Pro Trp 
35 40 45 

Phe Pro Glu Gin Gly Thr Leu Asp Leu Lys Asp Trp Lys Arg lie Gly 
50 55 60 

Glu Glu Leu Lys Gin Ala Gly Arg Lys Gly Asn lie lie Pro Leu Thr 
65 70 75 80 

Val Trp Asn Asp Trp Ala lie lie Lys Ala Ala Leu Glu Pro Phe Gin 
85 90 95 

Thr Lys Glu Asp Ser Val Ser Val Ser Asp Ala Pro Gly Ser Cys Val 
100 105 110 

lie Asp Cys Asn Glu Lys Thr Gly Arg Lys Ser Gin Lys Glu Thr Glu 
115 120 125 

Ser Leu His Cys Glu Tyr Val Thr Glu Pro Val Met Ala Gin Ser Thr 
130 135 140 

Gin Asn Val Asp Tyr Asn Gin Leu Gin Gly Val lie Tyr Pro Glu Thr 
145 150 155 160 



Leu Lys Leu Glu Gly Lys Gly Pro Glu Leu Val Gly Pro Ser Glu Ser 
165 170 175 



Lys Pro Arg Gly 
180 

Leu Gin Pro Gin 
195 

Ala Tyr Gin Tyr 
210 

Glu Ser Gin Tyr 

225 

Ala Pro Tyr Pro 



Arg Ser Gly Gin 
260 

Lys Gin Gly Asp 
275 

Val Gin Ala Gly 

290 

Thr Arg Cys Glu 



Pro Ser Pro Leu 



Thr Gin Val Lys 
200 

Trp Pro Pro Ala 
215 

Gly Tyr Pro Gly 

230 

Gin Pro Pro Thr 
245 

Gly Gly Thr Leu 



Leu Glu Ala Trp 

280 

Glu Glu Thr Gin 

295 

Pro Phe Thr Met 



Pro Ala Gly Gin 
185 

Glu Asn Lys Thr 



Glu Leu Gin Tyr 

220 

Met Pro Pro Ala 

235 

Val Arg Leu Asn 
250 

His Ala Val lie 

265 

Arg Phe Leu Val 



Val Gly Ala Pro 
300 

Lys Met Leu 



Val Pro Val Thr 
190 

Gin Pro Pro Val 

205 

Leu Pro Pro Pro 



Leu Gin Gly Arg 
240 

Pro Thr Ala Ser 
255 

Asp Glu Ala Arg 
270 

lie Leu Gin Leu 

285 

Ala Arg Ala Glu 



